City Research Online Cisr at Inex 2006

نویسندگان

  • Wei Lu
  • Stephen Robertson
  • Andrew Macfarlane
چکیده

In this paper, we describe the Centre for Interactive Systems Research’s participation in the INEX 2006 adhoc track. Rather than using a fieldweighted BM25 model in INEX 2005, we revert back to using the traditional BM25 weighting function. Our main research aims in this year are to investigate the effects of document filtering by result record cut-off, element filtering by length cut-off and the effect of using phrases. The initial results show the latter two methods did not do well, while the first one did well on FOCUSED TASK and RELEVANT IN CONTEXT TASK. Finally, we propose a novel method for BEST IN CONTEXT TASK, and present our initial results.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Window-based Enterprise Expert Search

This is the first year for the participation of the City University Centre of Interactive System Research (CISR) in the Expert Search Task. In this paper, we describe an expert search experiment based on windowbased techniques, that is, we build profile for each expert by using information around the expert’s name and email address in the documents. We then use the traditional IR techniques to ...

متن کامل

INEX 2006 Evaluation Measures

This paper describes the official measures of retrieval effectiveness employed at the ad hoc track of INEX 2006.

متن کامل

The Interactive Track at INEX 2006

In this paper we describe the planned setup of the INEX 2006 interactive track. As the track has been delayed and data collection has not been completed before the INEX 2006 workshop, the track will continue into 2007. Special emphasis is put on comparing XML element retrieval with passage retrieval, and on investigating differences between multiple dimensions of the search tasks.

متن کامل

INEX REPORT Report on the XML Mining Track at INEX 2005 and INEX 2006 Categorization and Clustering of XML Documents

This article is a report concerning the two years of the XML Mining track at INEX (2005 and 2006). We focus here on the classification and clustering of XML documents. We detail these two tasks and the corpus used for this challenge and then present a summary of the different methods proposed by the participants. We last compare the results obtained during the two years of the track.

متن کامل

The Heterogeneous Collection Track at INEX 2006

While the primary INEX test collection is based on a single DTD, it is realistic to assume that most XML collections consist of documents from different sources. This leads to a heterogeneity of syntax, semantics and document genre. In order to cope with the challenges posed by such a diverse environment, the heterogeneous track was offered at INEX 2006. Within this track, we set up a collectio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015